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(54) VIDEO IMAGE SUMMARIZING METHOD AND DEVICE THEREFOR 

(57)Abstract: 

PURPOSE: To reduce a reproduction time by avoiding unnaturality in sound to be 
produced when a fast viewing is realized without changing a reproduction speed of 
a video image. 

CONSTITUTION: A video image received by a video image input section 101 in 
real time is fed to a video image reproduction control section 103 while being 
stored in a video storage section 102. When the user commands a reproduction 
method (1) from an interface section 109a cut-point detection section 104 
detects a cut-point of the video image. Then a low power detection section 105 
detects points at which power of audio information is reduced around the cut 
points and points at which the power of the audio information gets lower after and 
before the former points. Then a reproduction time is adjusted and buffering for a 
video image block is madea video output section 108 reproduces the video image 
from points at which the power of the audio information gets lower around each 
cut-point till the points of succeeding audio information and the reproduced image 
is outputted to a display device of the interface section 109. 



CLAIMS 



[Claim(s)] 

[Claim 1]As opposed to said video information photoed in accumulated video 
information or real time which consists of picture information and speech 
informationAn image summarizing method which reproduces an image between a 
point that asked for two or more cut points when a scene of a picture changes 
from said picture informationand power of speech information became low in the 



circumference of these each cut pointand a point that power of speech 
information became low before and behind thatone by one. 
[Claim 2]As opposed to said video information photoed in accumulated video 
information or real time which consists of picture information and speech 
informationAn image summarizing method which reproduces an image from a point 
that asked for two or more cut points when a scene of a picture changes from 
said picture informationand power of speech information became low in the 
circumference of these each cut point to a certain timeone by one. 
[Claim 3]As opposed to said video information photoed in accumulated video 
information or real time which consists of picture information and speech 
informationAn image summarizing method which reproduces an image from a point 
which only a certain time traced back from a point that asked for two or more cut 
points when a scene of a picture changes from said picture informationand power 
of speech information became low in the circumference of these each cut point to 
the point concerned that power of speech information became lowone by one. 
[Claim 4]The image summarizing method according to claim 3 which raises power 
as power of speech information corresponding to a point which starts reproduction 
is made low and it reproduceswhen reproducing an image. 

[Claim 5]The image summarizing method according to claim 2 which makes power 
of speech information low as a point which ends reproduction is approachedwhen 
reproducing an image. 

[Claim 6]An image abstract device comprising: 

An image accumulation part which accumulates a video input section which was 
photoed in real timeand which inputs video information which consists of picture 
information and speech informationand/or said video information. 
A cut check appearance part which asks for two or more cut points when a scene 
of a picture changes from said picture information. 

A point that power of speech information became low in the circumference of 
each of said cut pointand an image reproduction control sectionin which power of 
speech information includes a low power primary detecting element which detects 
************ | ow before and behind that. 

A point that power of speech information became low in the circumference of 
each of said cut pointa video output part which power of speech information is low 
and reproduces an image between ************s one by one before and behind 
thatand an interface part which controls setting out of a reproducing starting point 
of an imagea reproduction end pointor regeneration time. 

[Claim 7]An image abstract device comprising: 

An image accumulation part which accumulates a video input section which was 
photoed in real timeand which inputs video information which consists of picture 
information and speech informationand/or said video information. 
A cut check appearance part which asks for two or more cut points when a scene 
of a picture changes from said picture information. 

An image reproduction control section including a low power primary detecting 



element which searches for a point that power of speech information became low 
in the circumference of each of said cut point. 

A video output part which reproduces an image from a point that power of speech 
information became low in the circumference of each of said cut point to a certain 
timeone by oneand an interface part which controls setting out of a reproducing 
starting point of an imagea reproduction end pointor regeneration time. 

[Claim 8]An image abstract device comprising: 

An image accumulation part which accumulates a video input section which was 
photoed in real timeand which inputs video information which consists of picture 
information and speech informationand/or said video information. 
A cut check appearance part which asks for two or more cut points when a scene 
of a picture changes from said picture information. 

An image reproduction control section including a low power primary detecting 
element which searches for a point that power of speech information became low 
in the circumference of each of said cut point. 

A video output part which reproduces an image from a time of only a certain time 
going back from a point that power of speech information became low in the 
circumference of each of said cut point to the point concerned that power of 
speech information became lowone by oneand an interface part which controls 
setting out of a reproducing starting point of an imagea reproduction end pointor 
regeneration time. 

[Claim 9]An image abstract device which has an interface part which controls an 
image reproduction control section characterized by comprising the followinga 
video output part which reproduces an image in accordance with a selected 
regeneration methoddirections of a regeneration method of an imageand setting 
out of a reproducing starting point of an imagea reproduction end pointor 
regeneration time. 

An image accumulation part which accumulates a video input section which was 
photoed in real timeand which inputs video information which consists of picture 
information and speech informationand/or said video information. 
A cut check appearance part which asks for two or more cut points when a scene 
of a picture changes from said picture information. 

A low power primary detecting element which searches for a point that power of 
speech information became low in the circumference of said cut point. 
An image between a point that power of speech information became low in the 
circumference of each of said cut pointand a point that power of speech 
information became low before and behind that one by oneAn image by a certain 
time from the 1st regeneration method to reproduce and a point that power of 
speech information became low in the circumference of each of said cut point one 
by oneFrom the 2nd regeneration method to reproduce and a point which only a 
certain time traced back from a point that power of speech information became 
low in the circumference of each of said cut point. A regeneration method 



selecting part which at least two methods are beforehand set up among the 3rd 
regeneration method that reproduces an image to the point concerned that power 
of speech information became lowone by oneand chooses a directed regeneration 
method. 

[Claim 10]The image abstract device according to claim 8 or 9 containing a power 
adjustment part which raises power as said image reproduction control section 
makes low power of speech information corresponding to a point which starts 
reproduction when reproducing an image and is reproduced. 

[Claim 1 1]The image abstract device according to claim 7 or 9 containing a power 
adjustment part which makes power of speech information low as it approaches a 
point which ends reproductionwhen said image reproduction control section 
reproduces an image. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Industrial Application]This invention relates to an image summarizing method and 

a device. 

[0002] 

[Description of the Prior Art]Although the abstract art in consideration of the 
semantic contents of the image does not exist nowa certain physical quantity is 
extracted from an imageand the art of shortening the time at the time of taking 
out the portion which fulfills conditions with physical quantityand seeing an image 
exists. For exampleaccording to the difference quantity for every frame of a 
picturethe variation of luminosityand the variation of a color and a histogramit 
reproduces at high speed [ when there is little change ]The cut point of the 
method (Japanese Patent Application No. 3-20519 "image-contents compression 
display processing method") of reproducing at the rate of usualwhen there is much 
changeor an image is detectedThere are some (Japanese Patent Application No. 
5-15422 "method of creating the Hayami image and device") which realize slanting 
reading of an image by reproducing the image of immediately after a cut point or a 
just before for several secondsor fixing the length of the image corresponding to 
one shotand reproducing the image of fixed time. There is art of displaying the 
picture immediately after a cut point on a display side by side spatially as a still 
pictureor printing on paper and grasping the contents at a glance. 
[0003] 

[Problem(s) to be Solved by the Invention]In order to acquire required information 
efficiently out of the overflowing video informationit is necessary to see an image 
for a short time. Although there were some to which the speed reproduced 
according to the characteristic quantity extracted from picture information is 
changed conventionallywhen reproducing at high speedspeech information could 



not be addedbut the amount of information for grasping the contents was falling. It 
became a piece which a sound strikes with the method of reproducing only cut 
point order in the reproductive starting point and end point when reproducing an 
image since speech information is not taken into considerationand was unnatural. 
[0004]The purpose of this invention cancels the unnaturalness of the sound made 
when realizing Hayami of an imagewithout changing the reproduction speed of an 
imageand there is in providing the image summarizing method and device which are 
considered as an image abstract with shortening of regeneration time. 
[0005] 

[Means for Solving the Problem]As opposed to said video information photoed in 
accumulated video information or real time when an image summarizing method of 
claim 1 becomes from picture information and speech informationlt asks for two or 
more cut points when a scene of a picture changes from said picture 
informationand an image between a point that power of speech information 
became low in the circumference of these each cut pointand a point that power of 
speech information became low before and behind that is reproduced one by one. 
[0006]As opposed to said video information photoed in accumulated video 
information or real time when an image summarizing method of claim 2 becomes 
from picture information and speech informationlt asks for two or more cut points 
when a scene of a picture changes from said picture informationand an image from 
a point that power of speech information became low in the circumference of 
these each cut point to a certain time is reproduced one by one. 
[0007]As opposed to said video information photoed in accumulated video 
information or real time when an image summarizing method of claim 3 becomes 
from picture information and speech informationlt asks for two or more cut points 
when a scene of a picture changes from said picture informationand an image from 
a point which only a certain time traced back from a time of power of speech 
information becoming low in the circumference of these each cut point to the 
point concerned that power of speech information became low is reproduced one 
by one. 

[0008]Power of speech information corresponding to a point which starts 
reproduction is made lowand when reproducing an imagepower can be raised in an 
image summarizing method of claim 3 as it reproduces. 

[0009]In an image summarizing method of claim 2when reproducing an imagepower 
of speech information can be made small as a point which ends reproduction is 
approached. 

[0010]An image accumulation part which accumulates a video input section into 
which an image abstract device of claim 6 inputs video information which was 
photoed in real timeand which consists of picture information and speech 
informationand/or said video informationA cut check appearance part which asks 
for two or more cut points when a scene of a picture changes from said picture 
informationA point that power of speech information became low in the 
circumference of each of said cut pointand an image reproduction control 
sectionin which power of speech information includes a low power primary 



detecting element which detects ************ low before and behind thatlt has a 
video output part which reproduces an image between a point that power of 
speech information became low in the circumference of each of said cut pointand 
a point that power of speech information became low before and behind thatone 
by oneand an interface part which controls setting out of a reproducing starting 
point of an imagea reproduction end pointor regeneration time. 
[001 1]An image accumulation part which accumulates a video input section into 
which an image abstract device of claim 7 inputs video information which was 
photoed in real timeand which consists of picture information and speech 
informationand/or said video informationA cut check appearance part which asks 
for two or more cut points when a scene of a picture changes from said picture 
informationAn image reproduction control section including a low power primary 
detecting element which detects a point that power of speech information became 
low in the circumference of each of said cut pointlt has a video output part which 
reproduces an image from a point that power of speech information became low in 
the circumference of each of said cut point to a certain timeone by oneand an 
interface part which controls setting out of a reproducing starting point of an 
imagea reproduction end pointor regeneration time. 

[0012]An image accumulation part which accumulates a video input section into 
which an image abstract device of claim 8 inputs video information which was 
photoed in real timeand which consists of picture information and speech 
informationand/or said video informationA cut check appearance part which asks 
for two or more cut points when a scene of a picture changes from said picture 
informationAn image reproduction control section including a low power primary 
detecting element which searches for a point that power of speech information 
became low in the circumference of each of said cut pointA video output part 
which reproduces an image from a time of only a certain time going back from a 
point that power of speech information became low in the circumference of each 
of said cut point to the point concerned that power of speech information became 
lowone by onelt has an interface part which controls setting out of a reproducing 
starting point of an imagea reproduction end pointor regeneration time. 
[0013]An image accumulation part which accumulates a video input section into 
which an image abstract device of claim 9 inputs video information which was 
photoed in real timeand which consists of picture information and speech 
informationand/or said video informationA cut check appearance part which asks 
for two or more cut points when a scene of a picture changes from said picture 
informationA low power primary detecting element which detects a point that 
power of speech information became low in the circumference of each of said cut 
pointAn image between a point that power of speech information became low in 
the circumference of each of said cut pointand a point that power of speech 
information became low before and behind that one by oneAn image by a certain 
time from the 1st regeneration method to reproduce and a point that power of 
speech information became low in the circumference of each of said cut point one 
by oneFrom the 2nd regeneration method to reproduce and a point which only a 



certain time traced back from a point that power of speech information became 
low in the circumference of each of said cut point. An image reproduction control 
section in which at least two methods are beforehand setand contain a 
regeneration method selecting part which chooses a directed regeneration method 
among the 3rd regeneration method that reproduces an image to the point 
concerned that power of speech information became lowone by oneA video output 
part which reproduces an image in accordance with a selected regeneration 
methoddirections of a regeneration method of an imageand a reproducing starting 
point of an imagelt has an interface part which controls setting out of a 
reproduction end point or regeneration time. 

[0014]In an image abstract device of claim 8 or 9when an image reproduction 
control section reproduces an imageit can make low power of speech information 
corresponding to a point which starts reproductionand can contain a power 
adjustment part to reproduce and which is aliketherefore raises power. 
[0015]In an image abstract device of claim 7 or 9when an image reproduction 
control section reproduces an imageit can contain a power adjustment part which 
makes power of speech information low as it approaches a point which ends 
reproduction. 
[0016] 

[Function]The image summarizing method of claim 1 can cancel the piece which an 
unnatural sound beatswhen the power of speech information uses for the 
reproducing starting point and end point of an image the portion which became low. 
[0017]The image summarizing method of claim 2 can cancel the piece which the 
unnatural sound in the reproducing starting point of an image beatswhen the power 
of speech information uses for the reproducing starting point of an image the 
portion which became low. 

[0018]The image summarizing method of claim 3 can cancel the piece which the 
unnatural sound in the reproduction end point of an image beatswhen the power of 
speech information uses for the reproduction end point of an image the portion 
which became low. 

[0019]Since the length of each reproducing section can be determined 
arbitrarilythe image summarizing method of claims 2 and 3 is effective when 
regeneration time is restricted. 

[0020]When reproducing an imagethe image summarizing method of claim 4 makes 
low power of the speech information corresponding to the point which starts 
reproductionand it can cancel the piece which the unnatural sound in the 
reproducing starting point of an image beats by raising power as it reproduces. 
[0021]When the image summarizing method of claim 5 reproduces an imageit can 
cancel the piece which the unnatural sound in the reproduction end point of an 
image beats by making power of speech information low as it approaches the point 
which ends reproduction. 

[0022]The image abstract device of claims 67and 8 is a device which enforces the 

image summarizing method of claims 12and 3respectively. 

[0023]The abstract device of claim 9 enables it to choose one regeneration 



method from two or more regeneration methods with one device. 

[0024]The image abstract device of claims 10 and 1 1 is a device which enforces 

the image summarizing method of claims 4 and 5respectively. 

[0025] 

[Example]Nextthe example of this invention is described with reference to 
drawings. 

[0026]The flow chart with which drawing 1 shows the block diagram of the image 
abstract device of one example of this inventionand drawing 2 shows the flow of 
processing of the image reproduction control section 103and drawing 3 are the 
figures showing an example of the interface part 109. 

[0027]The video input section 101 into which the image abstract device of this 
example inputs an imageand the image accumulation part 102 which accumulates 
the image inputted in real time and outputs the accumulated image in real timelt 
comprises the image reproduction control section 103 which controls the image to 
reproducethe video output part 108 which reproduces an image and is 
outputtedand the interface part 109 to which a user points to a regeneration 
methodand a reproducing starting pointa reproduction end point and regeneration 
timeand shows a user the reproduced image. The cut check appearance part 104 
from which the image reproduction control section 103 detects the cut point when 
the scene (shot) of an image changesThe power of the speech information of an 
image comprises the power adjustment part 106 which adjusts the power of the 
speech information of the low power primary detecting element 105 which detects 
a low portionthe point which starts reproductionand the point to endand the 
regeneration method selecting part 107 which chooses regeneration time and a 
regeneration method. The interface part 109 comprises the display 401 which 
projects an imageregeneration method selection button 402 , - 402 3 and the 
regeneration time selection knob 403. 

[0028]Herethe following three kinds are set as the regeneration method selecting 
part 107 as a regeneration method. 

(1) Reproduce the image between the point that the power of speech information 
became low in the circumference of each cut point of an imageand the point that 
the power of speech information became low before and behind thatone by one. 

(2) Reproduce a certain time and an image one by one from the time of the power 
of speech information becoming low in the circumference of each cut point of an 
image. 

(3) Reproduce the image from the point which only a certain time traced back from 
the time of the power of speech information becoming low in the circumference of 
each cut point of an image to the point concerned that the power of speech 
information became lowone by one. 

[0029]Nextoperation of this example is explained. 

[0030]The image inputted by the video input section 101 in real time is sent to the 
image reproduction control section 103being accumulated in the image 
accumulation part 102. In the case of the image accumulated beforehandit is 
directly sent from the image accumulation part 102 at the image reproduction 



control section 103. Nextprocessing shown by drawing 2 is performed based on 
the regeneration method to which the user pointed by the interface part 109. 
Firstcut check appearance processing 201 is performed to an image by the cut 
check appearance part 104. The method by "the video-cuts check attitude 
method and device" of Japanese Patent Application No. 5-317663etc. are used for 
detection of a cut point. Nextlow power detection processing 202 is performed in 
the low power primary detecting element 105and the point that the power of 
speech information became low in the circumference of a cut point is searched for. 
The value of the sum of squares of the voice waveform in every several msetc. 
are used for detection of low power. Since regeneration time cannot specify 
strictly when using the low power part of speech informationprocessing is 
performed so that a low power part which becomes the closest to the specified 
regeneration time may be used. In not using a low power part for the end of 
reproduction of an imageand a reproducing starting point (a regeneration method 
(2)(3))weighting as shown in 301 and 302 of drawing 3 is performed in the power 
adjustment part 106 (power adjustment processing 203)and it reduces the piece 
which an unnatural sound beats. Although power adjustment is performed with the 
primary function at drawing 3 it is also possible to attach a smooth change like a 
sigmoid function. After reproduction pretreatment 204 of adjustment of 
regeneration timebuffering of each image sectionetc. is performed by the 
regeneration method selecting part 107an image is reproduced in the video output 
part 108. The reproduced image is projected on the display 401 of the interface 
part 109. 

[0031]Although it enabled it to choose two or more regeneration methods in this 
exampleit may be made to perform only any one regeneration method. It may be 
made to include the display 401 in the video output part 108. At least either of the 
video input section 101 and the image accumulation part 102 is good. 
[0032] 

[Effect of the InventionjAs explained abovethis invention has an effect as taken 
below. 

(1) The invention of claims 16and 9 can cancel the piece which the unnatural 
sound in the reproducing starting point and end point of an image beatswhen the 
power of speech information uses for the reproducing starting point and end point 
of an image the portion which became low. 

(2) The invention of claims 27and 9 can cancel the piece which the unnatural 
sound in the reproducing starting point of an image beatswhen the power of 
speech information uses for the reproducing starting point of an image the portion 
which became low. 

(3) The invention of claims 38and 9 can cancel the piece which the unnatural 
sound in the reproduction end point of an image beatswhen the power of the 
speech information of the reproduction end point of an image uses the portion 
which became low. 

(4) When the invention of claims 4 and 10 reproduces an imageit can cancel the 
piece which the unnatural sound in the reproducing starting point of an image 



beats by raising power as power of the speech information corresponding to the 

point which starts reproduction is made low and it reproduces. 

(5) When the invention of claims 5 and 1 1 reproduces an imageit can cancel the 

piece which the unnatural sound in the reproduction end point of an image beats 

by making power of speech information low as it approaches the point which ends 

reproduction. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is a block diagram showing the outline composition of the image 
abstract device of one example of this invention. 

[Drawing 2] It is a flow chart which shows the flow of processing of the image 
reproduction control section 104 of drawing 1 . 

[Drawing 3] It is a figure showing a situation when weighting is given to the power 
of the speech information of the reproducing starting point by this exampleand an 
end point. 

[Drawing 4] It is a figure showing the composition of the interface part 109 of 
drawing 1 . 

[Description of Notations] 

101 Video input section 

102 Image accumulation part 

1 03 Image reproduction control section 

104 Cut check appearance part 

1 05 Low power primary detecting element 

106 Power adjustment part 

107 Regeneration method selecting part 

108 Video output part 

109 Interface part 

201 Cut check appearance processing 

202 Low power detection processing 

203 Power adjustment processing 

204 Reproduction pretreatment 
301302 weighting 

401 Display 

402 ! - a 402 3 regeneration method selection button 

403 Regeneration time selection knob 
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